Loops in One Dimensional Random Walks 
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Distribution of loops in a one-dimensional random walk (RW) , or, equivalently, neutral segments 
in a sequence of positive and negative charges is important for understanding the low energy states 
of randomly charged polymers. We investigate numerically and analytically loops in several types 
of RWs, including RWs with continuous step-length distribution. We show that for long walks 
the probability density of the longest loop becomes independent of the details of the walks and 
definition of the loops. We investigate crossovers and convergence of probability densities to the 
limiting behavior, and obtain some of the analytical properties of the universal probability density. 
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I. INTRODUCTION 

One reason for the growing interest in polymers M 
is the desire to understand long chain biological macro- 
molecules, and especially proteins. An important class 
of polymers is poly ampholytes (PAs) ||, which are het- 
cropolymers that carry a mixture of positive and negative 
charges. In recent years, much attention has been given 
to the ground state conformations of randomly charged 
PAs d^j- The study of randomly charged PAs sug- 
gests that their ground state has a structure similar to 
a necklace, made of neutral or weakly charged parts of 
the chain, compacting into globules, connected by highly 
charged stretched strings B. This structure is a com- 
promise between the tendency to reduce the surface area 
due to surface tension, and the tendency to expand due 
to the Coulomb repulsion of the total (excess) charge. 
A complete analytical characterization of this structure 
within the necklace model (which was obtained for homo- 
geneously charged polymers Q), was so far not obtained. 

Since a key role in the structure of randomly charged 
PAs is played by the neutral segments in the chain (form- 
ing the beads in the necklace), Monte Carlo (MC) meth- 
ods were applied to study their size distribution |3]-||. 
This problem can be investigated by mapping the charge 
sequence of the PA into a one-dimensional (1-d) RW. 
The charge sequence u — {qi} (i — 1, ...,7V; qi = ±1) 
is mapped into a sequence of positions Si(u>) — J2)=i 9j 
(Sq = 0) of a random walker (the charges are measured 
in units of the basic charge, and therefore qj is dimen- 
sionless). The random sequence of N charges is thus 
equivalent to an iV-step RW, a chain segment with an 
excess charge Q corresponds to a RW segment with to- 
tal displacement of Q steps, and a neutral segment is 
equivalent to a loop inside the RW (see Fig. ^]). 

Kantor and Erta§ [§-§ attempted to quantify the neck- 
lace model, by postulating that the ground state of a PA 
will consist of a single globule, formed by the longest 
neutral segment of the PA, and a tail, formed by the 
remaining part. However, when the longest neutral seg- 
ment forms a globule, the typical tail is very large (~ N). 
Since the tail itself may contain neutral segments, it will 



further fold into globules, in order to reduce the total en- 
ergy . This structure is depicted in Fig. EH: The longest 
neutral segment contains L monomers; it compacts into a 
globule of linear size proportional to L 1 ' 3 ; in the remain- 
ing part of the chain the longest neutral segment (the 
2nd longest neutral segment) of size L 2 compacts into a 

1/3 

globule of radius L 2 ' , then the 3rd and so on, until the 
segments become very small (of only a few monomers). 
Eventually, all the neutral segments are exhausted and 
we are left only with strings which carry the PA's excess 
charge Q, and connect the globules. 
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FIG. 1. An example of a charge sequence uj (top), mapped 
into a 1-d RW Si(uj) (middle), and a typical loops structure (bot- 
tom). Filled circles indicate the starting and ending points of 
loops. The longest loop in the RW has 8 steps (L = 8), the excess 
charge (which is equivalent to the total displacement of the RW) is 
Q = +8, and the total length is iV = 28. 

There are several models in which a charge sequence 
is broken into neutral segments (similar to a RW bro- 
ken into loops) Jul. The relation between these models 
and the model described above was detailed in a previous 
work , but none of them consider issues of the longest 
loops. 



In this work we investigate the probability density p(l) 
of a longest loop in an AT-step RW to be of reduced length 
I = L/N, where L is the number of steps in the loop. 
This probability density was extensively studied |6|~|9|, 
and some of its properties were found analytically. A nu- 
merical evidence for the ./V-independence of p(l) in the 
N — > oo limit was presented, but a complete analyti- 
cal description of it was not found. We present numeri- 
cal and analytical evidence that this probability density 
becomes, for long walks, 'universal' for a large class of 
RWs. In order to demonstrate this, we define in section 
U the problem of the longest loop for RWs with continu- 
ous step-length distribution, and in particular we define 
what is called a loop. We then investigate numerically 
the crossover from finite N to A~ — ► oo for different pa- 
rameters defining a loop. In section III we demonstrate 
the 'universality' by showing that the probability density 
of the longest loop becomes, for A" — > oo, independent of 
N, and of the parameter defining a loop, and is the same 
for both RWs with steps of fixed length and steps of con- 
tinuous length distribution. In section IV we investigate 
this 'universal' probability density, and obtain some of 
its properties. 



II. LONGEST LOOPS IN RANDOM WALKS 
WITH CONTINUOUS STEPS 

In this section, unlike the models mentioned in the pre- 
vious section, we consider the problem of longest loops 
in RWs with continuous step-length distribution. Ac- 
cording to the central limit theorem |ll[ , the end to end 
distance of a RW, in which the steps are uncorrelated 
and have finite variance, approaches a Gaussian proba- 
bility distribution with increasing number of steps. In a 
long RW we can group a number of adjacent steps into 
a 'rescaled step'. If we repeat this rescaling process, the 
probability distribution of a rescaled step length will ap- 
proach a Gaussian form. It is, therefore, convenient to 
start with Gaussian elementary steps, distributed accord- 

ing to probability density f(x) = e 2^/av27r, since the 
rescaling process does not modify the functional form of 
the distribution of a step (except for increased variance) . 
We shall denote such a RW, as a Gaussian RW. Although 
we shall address in this study Gaussian RWs, most of the 
results will be more general, and valid for large class of 
RWs with continuous step-length distribution. In Gaus- 
sian RWs, the probability density of the position x of the 
'random walker' after N steps is: 



f(x,N) 



1 



ay/2nN 



(1) 



where a 2 is the variance of a single step. Since a RW with 
continuous distribution of steps never exactly returns to 
a previously visited position, we say that a loop is closed 
if two steps in the RW are closer than e from each other. 
Therefore, the probability of having a longest loop of L 



steps in such a RW depends on the number of steps N, 
on the standard deviation of the Gaussian step a, and on 
e defined above. From dimensional arguments, it is clear 
that the probability of the longest loop depends on e and 
a only through e/a. 

As in the problem with fixed step length, it is con- 
venient to work with a probability density of the longest 
loop, and to explore it as a function of the reduced length 
I. The probability density is defined by p(l;N,e/a) — 
N ■ P(L; N, e/a), where P(L; N, e/a) is the probability of 
having a longest loop of L steps in a RW with given a 
and N, when the distance defining a loop is e. 




0.3 0.4 0.5 0.6 0.7 

Length of longest loop - l=L/N 
FIG. 2. p(l; N,e/a) vs. I for several values of e/a: 10 (1), 
1CT 4 (2), 1CT 3 (3), 1CT 2 (4), 0.1 (5), 1 (6), 10 (7). All graphs show 
MC results of 10 6 randomly selected sequences of length N = 300. 



For a RW with steps of fixed length, it was shown nu- 
merically |7|] that the probability density of the longest 
loop becomes ./V— independent already for modest values 
of N. However, the probability density of the longest 
loop for a RW with continuous step length distribution 
strongly depends on e. This dependence is depicted in 
Fig. 0, showing p(l; N, e/a) for values of e/a ranging from 
10~ to 10, and for N = 300. We now investigate this de- 
pendence: The positions of steps of an A^— step RW are 
spread out over a distance of order ayN . The typical 
probability of a loop between two given steps for small e 
is of order of e divided by the entire spread of the RW, 
Since there are ~ N 2 pairs of steps which can 



/N 



form a loop, the mean total number of loops in the e — > 
limit is of order of N 3 ' 2 e/a. In the approximation of 
having no more than one loop per chain (N 3 / 2 e/a < 1), 
it is possible to find p(l; N, e/a) analytically. Neglecting 

terms O (e/a) , the probability density in this single 

loop (s.l.) approximation is: 



p s . h (l;N,e/a) 



2 AT3/2 £ 1-1 

- a Si 



1 N 3/2JLU(1) . 



(2) 



We see that when e/a <C 1/TV 3 / 2 no loops are formed, 
and p(l; N,e/a) ~ 5(1). At the opposite extreme, when 
e/a ^ VTV, every step closes a loop which originates 
at almost all the other steps in the walk, and therefore 
the first step generates a loop with the last, resulting in 
p(l; TV, e/a) ~ 6(1 — 1). And indeed, in Fig.pl we see that 
graph 1, representing small e/a, has a strong divergence 
at I = 0, while graph 7 shows the signs of strong di- 
vergence at I = 1 (although e/a is not very big in this 
case). The intermediate values of e/a exhibit crossover 
shapes. In particular we note that graphs 5 and 6, where 
e/a = 0.1 and 1 respectively, have a shape closely resem- 
bling the probability density p(l) of the longest loop in 
RWs with steps of fixed length J7]|§, in the large TV limit. 

We investigated the TV— dependence of p(l; TV, e/a), and 
obtained different qualitative behaviors for different val- 
ues of e/a. For fixed e/a < 1 the function p(l;N,e/a) 
changes gradually from 5(1) towards p(l) as TV increases. 
This behavior resembles that in Fig. (constant TV and 
changing e/a), in a way that increasing TV (for constant 
e/a) is equivalent to increasing e/a (for constant TV). For 
fixed e/a > 1, the function p(l;N,e/a) changes from 
5(1 — 1) towards p(l) as TV increases, in a way that in- 
creasing TV (for constant e/a) is equivalent to decreasing 
e/a. At e/a ~ 1, the probability density p(l; N,e/a) is 
almost independent of TV, and converges very quickly to 
p(l). 

The qualitative arguments of the previous paragraph 
indicate that for small values of e the function p(l; TV, e/a) 
depends on TV and e/a only through N 3 ' 2 e/a. Numerical 
comparison of several probability densities, having differ- 
ent values of TV and e/a but the same value of TV 3 / 2 e/a, 
confirms this dependence. In the opposite limit e/a 3> 1, 
we can find through qualitative rescaling arguments a 
single parameter for the dependence of p(l; TV, e/a) on TV 
and e/a. In order to represent a chain with given TV, e 
and a as a chain with TV* 'effective steps' and e* /a* = 1, 
we group n = (e/a) steps of the original chain into an 
'effective step' a* = a^/n = a^J(e/a) 2 = e (where e* = e). 
Since the probability of having a loop of certain length 
(for loops longer than (e/a) steps in the original chain) 
is the same in both chains, then any chain with TV steps 
and e/a > 1 can be divided to a chain with TV* = , ^ .% 
steps and e* /a* — 1 (if e/a is large enough, so that a 
rescaling of (e/a) steps is meaningful, but is neverthe- 
less smaller than y/N, where TV* becomes unity). It is 
therefore reasonable to assume that for 1 <C - < V~N, 
the probability density of the longest loop depends only 
on TV /(e/a) 2 . Numerical comparison of several probabil- 
ity densities, having different values of TV and e/a but the 
same value of N /(e/a) 2 , confirms this dependence. 

We want to investigate the dependence of the shapes 
of the different graphs of p(l; TV, e/a) on e/a and TV, and 
find a fixed point. Since an infinite number of parame- 
ters is required to characterize a graph, we simplify the 
investigation by characterizing each graph by a single pa- 
rameter (I) - the average reduced length of the longest 



loop. Since the functions p(l; TV, e/a), depicted in Fig.g, 
are normalized functions which change gradually from 
5(1), where (1) = to 6(1 - 1), where (I) = 1, then (I) 
provides a reasonable characterization of the entire prob- 
ability density for all I. Therefore, the fixed point for the 
entire graph of the function, is obtained when the aver- 
age (I) does not change with TV and e/a. The dependence 
of (I) on e/a and TV is depicted in Fig.y. The curves in 
Fig. are lines in the 'e — TV' plane (on a logarithmic 
scale), along which (I), (calculated by p(l;N,e/a) from 
10 6 random sequences) has constant value. 
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FIG. 3. Lines along which (I) is constant (see text), as a func- 
tion of N and of e/a on a logarithmic scale (TV = 30 to 1000 and e/a 
= 0.001 to 10). The labels above the lines are the average lengths 
(I) along them. The values of (I) along adjacent lines differ by 0.05. 

It is evident from Fig. f| that the curves from both sides 
of the e/a ~ 1 line are not identical, confirming that the 
dependence oi p(l; TV, e/a) on e/a is different in value be- 
tween e/a < 1 and e/a > 1. The value e/a ~ 1 is the only 
value of e/a for which the probability density is (almost) 
the same for all TV (resulting in (I) ~ 0.76). Furthermore, 
it is evident from the figure that as TV increases, (I) ap- 
proaches 0.76 for all values of e/a. It can be shown that 
as (I) approaches 0.76, the probability density p(l; TV, e/a) 
becomes very similar to p(l). We therefore conclude, that 
as TV — > oo, the probability density p(l; TV, e/a) becomes 
independent of the values of e, a and TV, and very similar 
to p(l). 

In order to demonstrate that the obtained numerical 
results are not just an effect of a specific definition of 
what is called a loop, and are typical for RWs with con- 
tinuous step-length distribution, we repeated the numer- 
ical processes for several other definitions of a loop. For 
instance, we examined the case where the distance defin- 
ing a loop is a random variable, having different values for 
every pair of steps. The results obtained for these defini- 
tions were very similar to those detailed above. Similar 
results for several definitions of a loop lead us to the no- 
tion that the probability density of the longest loop is 
'universal', not just in the sense that it does not depend 



on the values of e and a, but that it does not depend on 
the details of the definition of what is called a loop. 



The probability density of the longest loop is obtained 
by finding the longest loop for many independent RWs 
of to 'effective steps'. 



III. UNIVERSALITY OF THE PROBABILITY 
DENSITY OF THE LONGEST LOOP 

Motivated by the numerical results of the previous sec- 
tion, we try to demonstrate in a more exact way, in what 
sense the probability density of the longest loop is uni- 
versal. We first show that for an A-step RW with steps 
of fixed length (discrete RW) this probability density be- 
comes independent of N for large N. We then generalize 
our results to continuous Gaussian RWs. 



A. Discrete Random Walks 

In order to show that the probability density of the 
longest loop for discrete RWs becomes (for long walks) 
independent of N, we perform a rescaling process: We 
divide a given long RW with steps of fixed length into m 
equal sub- walks, which are 'effective steps' in the rescaled 
walk. Each of the to sub-walks has a minimum and a 
maximum position of steps inside it. A loop between two 
sub-walks is defined as a loop in the original RW, which 
starts at a step in one sub-walk, and ends at a step in 
the second sub-walk. Such a loop is formed when the 
two sub-walks 'intersect' each other (both of them reach 
the same position). This intersection occurs if either the 
minimum or maximum of one sub-walk is between the 
minimum and maximum of the other sub-walk. 

The distribution of these minima and maxima depends 
only on the positions of the first and last steps of each 
sub- walk, and therefore, the probability of a loop between 
any two given sub-walks can be calculated analytically 
for any value of N and to. As N — > oo the functional 
forms of the distributions of minima and maxima become 
simple, and we therefore perform our calculation in that 
limit. Appendix describes the calculation of a joint 
probability density g(w, W; x) of a given RW, ending at 
a position x relative to its origin, to have a minimum — w 
and a maximum W. This probability density is valid for 
both discrete RWs and continuous Gaussian RWs, and is 
depicted in Fig.[|. 

However, the calculation of the probability of a loop 
between two sub-walks does not enable to find the prob- 
ability density of the longest loop, since it disregards the 
dependencies between the probabilities of loops. It is 
therefore convenient to turn to numerical MC methods: 
We numerically generate independent RWs of m 'effec- 
tive steps', where the distances between their positions 
are distributed according to a Gaussian probability. Each 
effective step is assigned random values of a minimum — w 
and a maximum W , according to g(w, W; x), where x is 
the distance between this step and the next. We can thus 
obtain all the loops in the RW, and find the longest loop. 




W = maximum of RW " '" w = -minimum of RW 

FIG. 4. Mutual probability density of having a RW with a min- 
imum —w and a maximum W: g(w, W; X = 1) vs. w and W. All 
lengths are measured in units of the standard deviation of the end 
position of the RW (see appendix W) . 

The probability density of the longest loop, emerging 
from the described rescaling process, converges with in- 
creasing to to the N — > oo limit of the probability density 
p(l) of the longest loop for discrete RWs: Apparently, the 
probability density emerging from the rescaling process 
should be equal to p(l), since a loop in the rescaled RW is 
generated if and only if there is a loop in the original dis- 
crete RW. However, given a loop between two segments in 
the rescaled chain, it is impossible to know where in the 
segments were the 'original' steps which generated the 
loop, and therefore, for each loop in the rescaled chain 
there is an inherent inaccuracy of up to (plus or minus) 
one segment. As to — ► oo this inaccuracy vanishes. This 
convergence is evident from Fig. p. We denote the proba- 
bility density of having a longest loop of length I = L/m, 
in a RW of to 'effective steps', by / m (Z). We have nu- 
merically obtained f m (l) for m=4, 10, 20, 50 and 100 
(each from 10 5 independent random sequences of length 
to), and compared them to p{l), obtained by MC simula- 
tion of 10 6 random sequences of N — 1000 discrete steps. 
These functions are depicted in Fig.0. For each value 
of to, the possible values of I are —,—,•••, Hi^i The 
length of the longest loop is never zero, since it can be 
shown that neighboring sub-walks always make a loop, 
and is never equal to m, since a loop from the first to the 
last (roth) step is to— 1 segments long. It is evident from 
Fig. ^| that fm(l) converges very quickly with increasing 
to to p(l). 

The rescaling process described in this section demon- 
strates the N— independence (for large iVs) of the prob- 
ability density of the longest loop, for discrete RWs: 
We can perform this rescaling process to any given dis- 
crete (long) RW, and obtain the same probability den- 



sity of longest loop, independent of the number of steps 
N. Since the rescaling process is statistically exact, the 
probability density of the longest loop, calculated from 
the rescaled RW, converges with increasing number of 
segments to the probability density calculated from the 
given discrete RW. We therefore conclude that the prob- 
ability density of the longest loop for discrete RWs be- 
comes independent of N for large N. 
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Length of longest loop - l=L/N 
FIG. 5. fm{l) vs. I for several values of m: 4 (star), 10 (plus), 
20 (circle), 100 (dot) compared with p(l) (line) as generated by MC 
simulation of 10 6 random sequences of length N = 1000. 



B. Gaussian Random Walks 



We now generalize the results, obtained in the previ- 
ous sub-section for discrete RWs, to continuous Gaussian 
RWs. Since the calculation of the previous sub-section 
was conducted in the N — ► oo limit, the discrete nature 
of the RW does not affect it. The only difference is that 
apparently, two segments in a rescaled Gaussian RW can 
intersect each other, where there is no loop in the origi- 
nal Gaussian RW (this can happen when the positions of 
steps in one sub-RW which makes a segment are distant 
more than e, the distance defining a 'closed loop', from 
the positions of steps in the other segment). We demon- 
strate that such a situation cannot occur: We consider a 
Gaussian RW of N steps of size o, and e defined above. 
We scale the positions of steps of the RW by aV^/V, mak- 
ing the distance between the minimum and maximum of 
order unity. The N steps of the RW are spread along the 
position axis according to some probability density q(y), 
where J . q(y)dy = 1. The average number of steps 
of the RW at a certain position y within the (rescaled) 
interval defining a loop of Ay = e/(ay/N) is given by: 



The y/N dependence means that the average number of 
RW steps within the Ay interval diverges with increasing 
N, for all the positions along the RW, and for all e > 0. 
We see that in the large N limit the entire range along the 
position axis is covered by the 'e— ranges' (i.e. positions 
closer than e) of the steps in the Gaussian RW. Therefore, 
when the minimal or maximal coordinate reached by one 
Gaussian sub-RW is between the minimum and maxi- 
mum of another Gaussian sub-RW, it is always closer 
than e to a position of a certain step in the second sub- 
RW, and a loop is formed in the original Gaussian RW. 
We can therefore perform the rescaling process of the 
previous sub-section on a Gaussian RW, and get a prob- 
ability density of the longest loop, which is identical to 
the probability density for discrete RWs. We have thus 
shown that the probability density of the longest loop (for 
large enough Ns) is independent of N and is the same 
for both discrete RWs and Gaussian RWs, when a loop 
is defined according to an arbitrary e > 0. We therefore 
conclude that this probability density has some measure 
of 'universality'. 



IV. PROPERTIES OF THE 'UNIVERSAL' 
PROBABILITY DENSITY 



In the previous section we numerically obtained the 
probability density of the longest loop through a rescal- 
ing process. Although we cannot find this probability 
density analytically, there are some properties and re- 
lated probabilities that can be found analytically. 

Since a loop between two sub-walks in a discrete RW 
is formed if the two sub- walks intersect, we want to cal- 
culate the probability of such intersection. Fig.[| depicts 
two such sub- RWs (solid lines), which are part of one 
long RW (dashed line). 

position 




steps 



Nq(y)Ay = Nq(y) 



aVN 



N 



(3) 



FIG. 6. Two discrete RWs (solid lines), each of r steps, which 
are sub- walks of a single RW (connecting dashed line) . The origin 
of the second walk is shifted s along the position axis. See text for 
explanations of other labels. 



Each sub- walk is a discrete RW, having r steps of size 
a. (We are interested in the limit where t — > oo and 
a — ► so that ayfr is finite.) The position of the last step 
of the first sub-RW is x, while the second sub-RW begins 
at a position s relative to the origin, and 9 steps (along 
the original RW) after the first sub-RW ends. The two 
sub-walks intersect (forming a loop in the original RW) 
if and only if r, the maximum of one walk (the first walk 
in Fig.o) is greater or equal to z, the minimum of the 
other walk. 

These sub- walks are not independent RWs: Fixing the 
position s of the origin of the second sub-walk affects 
the probabilities of the possible states of the first sub- 
walk (and specifically the probability of the end-step po- 
sition), thus making it not completely random. This 'de- 
pendence' between the sub-walks is stronger when the 
number of steps 9 between them is small. In the ex- 
treme case, of neighboring sub-walks (i.e. 8 — and 
the second sub- walk begins where the first one ends), the 
end-position of the first sub- walk is fixed to be equal s. 
The probability of a loop between two 'dependent' sub- 
walks, separated by k sub-walks along the original RW 
(i.e. 9 — kt in Fig.pL denoted by Prw(«, A = — ^=), is 
derived in appendix |B| and depicted in Fig. [7[ 



a^> 




FIG. 7. Probability of a loop between two sub-walks Prw ( K > A) 
as a function of the normalized distance between their origins 
A = s _ and the number of sub- walks between them n (see text). 
Different lines for different values of k (from top): 1, 3, 10, K —> oo. 

We can now calculate the probability of a loop between 
any two sub- walks in a RW, and therefore the probability 
of having a loop of a given length. However, this calcu- 
lation does not enable us to find the probability density 
of the longest loop, since it disregards the dependencies 
between the probabilities of loops: The fact that there 
is a loop (or that there are no loops) between two other 
sub- walks in the RW, changes the probability Prw(k, A), 
of having a loop between two sub-walks, resulting in an 
expression that we cannot find analytically. 

Although we cannot find an analytical expression for 
p(l), the probability density of the longest loop, we can 



find some of its analytical properties in the limit of long 
loops (i.e. I — > 1). We divide a RW with N steps of 
fixed length a into m segments of r steps, and investi- 
gate /m( 2i — -), the probability of having a loop between 
the first and the last segments. When the last segment's 
origin is shifted s from the origin of the RW, the prob- 
ability of a loop between the first and last segments 

is Prw ( k = m — 2, A = ^= I . In order to obtain the 

probability of a loop between the first and last segments, 
Prw is integrated over all possible values of s with the 
probability of the RW to reach s after m — 1 segments. 
The resulting probability is: 



fn 



m — 1 



1 



m J y/2ir(m-l) 



oAe"2 ^tPrw (m-2, A). 



In the m —* oo limit, Eq. (Q) becomes: 
m — 1 \ 1 



/. 



m J xPh 



dA lim Prw("^, A)c 



(4) 
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(5) 



Substituting l = ^^ in Eq. (||), we get: 

3.2 



fm(lo) 



y/l-lo 



(6) 



The behavior of p(l) in the / 
Kantor and Erta§ 0: 

P(l) 



V^O^O 



1 limit was obtained by 



(7) 



where A is a numerically obtained constant (A = 1.011 ± 
0.001). This means that the probability p of a loop to 
be longer than Iq is (in the Iq — ► 1 limit): 



Pl = [ p(l)dl = -2=Ay/l-l 
Jl=l V 7 *" 



(8) 



It is evident (by comparing Eqs. (g) and (||) ) that the 
I— dependence of f m (l) is in accordance with the behavior 
of p(l) in the / — > 1 limit obtained in 0. 

Any loop longer than m — 1 segments in the RW be- 
gins at the first segment and ends at the last one, and 
is therefore 'counted' by / m ( £2 — -)■ On the other hand, 
all the loops that are between the first and the last seg- 
ments of the RW are longer than 2^— - of the chain. We 
therefore get for all m: 



P ln = ^± < fn 



771 — 1 



< P la = S t=l ■ (9) 



In the m — > oo , Zo — > 1 limit we get by substituting the 
definitions of P k and / m (^) to Eq.*®: 



2 „ 1 3.2 2 „ / 2 

-=,4-= < ; < -f=M — 

\/7r V m v27rm V 71 " V m 



(10) 



random sequences, 
very similar. 



As expected, these probabilities are 



From Eq. (|10| ) we can obtain analytical upper and lower 
bounds on A: 



0.80 < A< 1.13 , 



(11) 



which are satisfied by the known numeric value of A 
1.011 obtained in @. 



V. CONCLUSIONS AND DISCUSSION 

We have defined the problem of the longest loop for 
RWs with continuous step-length distribution, investi- 
gated the resulting probability densities, and showed that 
they converge with increasing number of steps in the RW 
to the probability density of the longest loop in RWs with 
steps of fixed length. These results motivated a rescaling 
process, which enabled us to obtain a probability density 
of the longest loop, which is independent of the number of 
steps and of the nature of the single step of the RW. We 
have presented numerical and analytical evidence, sug- 
gesting that this probability density is identical for both 
discrete and continuous RWs. Investigating this 'uni- 
versal' probability density, we have obtained some of its 
analytical properties in the limit of long loops. It may 
be possible to establish additional analytical properties 
of the problem, through further investigation of this uni- 
versal function. However, a full renormalization-group 
treatment of the problem within the derived rescaling 
process, in order to find a complete analytical solution to 
the problem, is expected to be quite complicated, since 
we are interested in an entire probability density and not 
in a single parameter. 

We note that the entire rescaling process described in 
this work is applicable only to 1-d RWs. Since for RWs 
in dimensions > 2 there is no analog to the minimum and 
maximum (there is no 'boundary' to the RW), we can- 
not perform the rescaling process (for both discrete and 
Gaussian RWs), and the results do not apply to such 
RWs. 

The rescaling process, leading to a 'universal' proba- 
bility density of the longest loop, leads, in a similar way, 
to a 'universal' probability density of the second longest 
loop (and third longest loop, and so on) . The definitions 
and processes of section III can be used in order to ob- 
tain the probability density of the second longest loop: 
When we performed the rescaling process, we first ob- 
tained all the loops in the rescaled RW, and only then 
the longest loop. Therefore, after 'erasing' the longest 
loop, the second longest loop can be obtained, and then 
the third, and so on. In Fig.g the probability density 
(from 10 5 random sequences) of the second longest loop 
of a rescaled chain, having m = 20 segments, is com- 
pared to the same probability density of a discrete RW 
of N = 1000 steps, obtained by MC simulation of 10 6 




0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 

Reduced length of 2nd longest loop 
FIG. 8. Probability density of the second longest loop of a 
rescaled chain of m=20 segments (x), compared with the probabil- 
ity density of the second longest loop of a discrete RW of ./V=1000 
steps (line). 

The 'universality' demonstrated in this study means 
that the probability densities of longest loops are inde- 
pendent of the number of steps and of the nature of the 
single step of the RW. Therefore, by mapping the 1-d 
RW into the charge sequence of a polymer, we see that 
the results obtained in previous works for a specific case 
of randomly charged PAs flq-P] are valid for a larger class 
of randomly charged polymers. 
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APPENDIX A: MUTUAL PROBABILITY OF A 

MINIMUM AND MAXIMUM OF A RANDOM 

WALK 

We present the derivation of the mutual probability of 
the minimum and maximum of a RW. The probability of 
the position of a 'random walker' satisfies the diffusion 
equation, and the problem of a 'random walker' bound 
between a minimum and a maximum can be represented 
as a diffusion between absorbing walls |ll|,[l2j| . We solve 
the diffusion equation in one dimension fL3|: 



a 2 d 2 f{x,t) df(x,t) 



dx 2 



<:)t 



(Al) 



for the probability density f(x 1 1) of a particle starting at 
the origin and taking steps of size a to be at a position 
x after t steps, with boundary conditions of vanishing / 
at — w and W. The eigenfunctions are of the form: 



nir(w + x)\ 1/ m, \ 

sin I — ) e 2 V »+»' > 

w + W 



(A2) 



and the prefactors are determined from the initial condi- 
tion /(x, t = 0) = 5{x). The resulting probability density 
(with respect to x) f w ,w(x, t) of a RW to be at a position 
x after t steps, when there are absorbing walls at —w and 
W, is given by: 



fw,w(x,t) = 

2 v-* 00 • / 
w+W 2~,n=l Sm ^ 



w+W 



n^ w +x) \ e -K^+w) 2a2 * 



w+W 



(A3) 



We are interested in the conditional probability of a given 
RW to have a maximum lower than W, and a minimum 
higher than — w. This conditional probability is equal 
to the derived probability density fw,w{x,t), divided by 
the probability density f(x, t) of the RW to be at a po- 
sition x after t steps. When measuring all distances (i.e. 
w, W and x) in units of ayt, we get for the conditional 
probability: 



G(w,W;x) = 

w+W^ ^n= 



-i^(^fw)^{ 



wk(w+x 

w+W 



o 2 V w + W J 



The derivative of G in respect to w and FT": 

d 2 G(w, W; x) 



g(w,W;x) 



dwdW 



(A4) 



(A5) 



is the mutual probability density (in respect to w and 
W) of a given RW, ending at a position x, to have a 
minimum — w and a maximum W. This derivation of 
g(w, W; x) can be repeated for a continuous Gaussian 
RW with step size a, and therefore the mutual proba- 
bility density q{w, W;x) is also valid for Gaussian RWs. 
Fig. [| in section [JTJ| depicts this probability density. 



APPENDIX B: PROBABILITY OF AN 
'INTERSECTION' BETWEEN RANDOM WALKS 

We present the derivation of the probability of an 'in- 
tersection' between two RWs, that are sub-walks of one 
RW with steps of fixed length. Such an intersection in- 
dicates that a loop is formed between the two sub-walks 
(see Fig. H in section IV). The probability of a loop is 
derived for an infinitely long RW, where each sub-walk 
can be treated within the Gaussian statistics. We use 
the notations of section IV - the number of steps in each 



sub- walk is r, the size of each step is a, there are kt steps 
of the RW between the two sub-walks, the end position 
of the first sub- walk is x, and the second sub- walk begins 
at a position s relative to the origin of the first sub-walk. 
Without loss of generality we can suppose that s > 0. 



The two sub-walks form a loop when the maximal coor- 
dinate of the first walk is greater or equal to the minimal 
coordinate of the second walk (labeled z) . 

The probability density M(r, r) of the maximal coordi- 
nate of a RW after r steps can be shown to be (for r > 0) 
twice the probability density of the position of the RW af- 
ter t steps [0|. For r < the probability density M(r, r) 
vanishes, since the maximum of a RW cannot be lower 
than its origin. By reflecting each RW about its origin 
(replacing +a with —a and vise versa), we see that for 
every RW having a maximum position of r, there is a re- 
flected RW having a minimum position of — r. Therefore, 
the probability density m(r, r) of the minimal coordinate 
of a RW after r steps equals M(—r, r). We thus get: 



M[r, t) = m(—r, r) 




a^/2-KT 



for r > 
for r < 

(Bl) 



where f(r, r) is the probability of a RW to be at a posi- 
tion r after r steps. 

In order to have a loop between the two sub- walks, 
three independent events must occur: 



(1) 



The maximal coordinate of the first sub- walk must 
be greater or equal to the minimal coordinate of 
the second sub-walk. The probability of a RW to 
reach a position x and to have a maximum greater 
than z after r steps is given by [[fl[ : 



Pi(x,z) 



f(2z-x,r) 

f(x,r) 



for z > max(0, x) 
otherwise 



(B2) 



(2) There is a RW between the end position of the first 
sub-walk and the origin of the second sub- walk, i.e. 
a RW of kt steps and a total displacement of s — x. 
However, the position s is fixed, and therefore this 
RW is restricted by the existence of a RW of kt + t 
steps from the origin to s. This probability is given 
by 



Fi{a,x) = 



J{s-x,kt) 
/(*,(« +1)t) ■ 



(B3) 



(3) The minimum of the second sub- walk is equal to z, 
i.e s — z relat ive to its origin. This probability is 
given by Eq. (Bl): 



P 3 {s,z)=m(z-s,T)-- 



2/(s- 




■z, t), for z < s 
, for z > s 



(B4) 



The probability of a loop, denoted by Prw( k , A) (for 
given s,a,T and re, where A = s/a^/r), is obtained by 
integration of P\ ■ P 2 ■ P3 over all values of z and x: 



i\w(«,A) 



P 1 P 2 P 3 dzdx 



(B5) 



00 ./ x— — 00 



From the definitions of Pi and P3 we see that if z < 
then Pi = f(x, t). If < z < s then Pi = f(x, r) when 
x > z, and if z > s then P3 vanishes. Therefore we get: 



Prw(k, A)= / dzP 3 (s,z) / dxf(x,T)P 2 (s,x) 

J— OO </ — oc 

+ / dzP 3 (s,z) / da;Pi(a;,z)P2(s,a;) (B6) 

JO J-00 

/■S /"OC 

+ / dzP 3 (s,z) / dxf(x,r)P 2 (s,x) . 



The resulting probability density is depicted in Fig.M in 



section 



IV 



[1] T. E. Creighton, Proteins: Structures and Molecular 
Properties, 2nd edn. (W. H. Freeman and Company, New 
York, 1993); P. G. de Gennes, Scaling Concepts in Poly- 
mer Physics (Cornell University Press, Ithaca, New York, 
1979); A. Y. Grosberg and A. R. Khokhlov, Statistical 
Physics of Macromolecules (AIP Press, New York, 1994). 

[2] C. Tanford, Physical Chemistry of Macromolecules (John 
Wiley and Sons, New York, 1961). 

[3] S. F. Edwards, P. R. King and P. Pincus, Ferroelectrics 
30, 3 (1980); P. G. Higgs and J. F. Joanny, J. Chem. 



M 



[5] 

[6] 
[7] 

[8] 

[9] 
[10] 



[11] 



[12] 
[13] 



Phys. 94, 1543 (1991); Y. Kantor and M. Kardar, Eu- 
rophys. Lett. 14, 421 (1991); Y. Kantor, H. Li and M. 
Kardar, Phys. Rev. Lett. 69, 61 (1992); Y. Kantor, M. 
Kardar and H. Li, Phys. Rev. E 49, 1383 (1994); J. 
Wittmer, A. Johner and J. F. Joanny, Europhys. Lett. 
24, 263 (1993); D. Bratko and A. M. Chakraborty, J. 
Phys. Chem. 100, 1164 (1996); N. Lee and S. Obukhov, 
Eur. Phys. J. B 1, 371 (1998); T. Soddemann, H. Schies- 
sel and A. Blumen, Phys. Rev. E 57, 2081 (1998). 
Y. Kantor and M. Kardar, Europhys. Lett. 27, 643 
(1994); Y. Kantor and M. Kardar, Phys. Rev. E 51, 1299 
(1995); Y. Kantor and M. Kardar, Phys. Rev. E 52, 835 
(1995). 

A. V. Dobrynin, M. Rubinstein and S. P. Obukhov, 
Macromolecules 29, 2974 (1996). 

Y. Kantor and D. Erta§, J. Phys. A27, L907 (1994). 
D. Ertas, and Y. Kantor, Phys. Rev. E 53, 846 (1996). 
D. Ertas. and Y. Kantor, Phys. Rev. E 55, 261 (1997). 
S. Woffling and Y. Kantor, Phys. Rev. E 57, 5719 (1998). 

B. Derrida, R. B. Griffiths and P. G. Higgs, Europhys. 
Lett. 18, 361 (1992); B. Derrida and P. G. Higgs, J. 
Phys. A27, 5485 (1994); B. Derrida and H. Flyvbjerg, 
J. Phys. A20, 5273 (1987); B. Derrida and H. Flyvbjerg, 
J. Phys. A19, L1003 (1986); G. F. Lawler, Duke Math. 
J. 47, 655 (1980); G. F. Lawler, Duke Math. J. 53, 249 
(1986); G. F. Lawler, J. Stat. Phys. 50, 91 (1988); A. J. 
Guttmann and R. J. Bursill, J. Stat. Phys. 59, 1 (1990); 
R. E. Bradley and S. Windwer, Phys. Rev. E 51, 241 
(1995); D. Dhar and A. Dhar, Phys. Rev. E 55, 2093 
(1997). 

W. Feller, An Introduction to Probability Theory and Its 
Applications, 3rd edn. (John Wiley and Sons, New York, 
1968), Vol. 1. 

S. Chandrasekhar, Rev. Mod. Phys. 15, 1 (1943). 
S. Karlin, A First Course in Stochastic Processes (Aca- 
demic Press, New York, 1966). 



